Similarity Perception of Visual objects: A Machine-Learning Approach

نویسندگان

  • Na-Yung Yu
  • Takashi Yamauchi
  • Ricardo Gutierrez-Osuna
چکیده

Though a traditional assumption in similarity judgment is that people selectively attend to certain features, few studies have explored the actual method of identifying “salient features” in visual stimuli. In this study, we used complex, realistic images to examine whether people selectively process salient features. Stimuli were triads of original and morphed animal face pictures. In the behavioral experiment, participants viewed two original pictures and a morphed composite of the originals and decided which original picture was more similar to the morphed picture. In the computational analysis, we employed Gabor function and wavelets, Gray-Level Cooccurrence Metrics (GLCM) combined with principal component analysis to extract candidate visual features, such as Gabor texture, brightness, size, and contour for the entire face as well as parts of the face. The simulated annealing algorithm was applied to behavioral data to determine possible weight distributions for the candidate features. The analysis suggests that people selectively attend to a few features when comparing visually complex and realistic images. How do people perceive similarity between visual objects? Similarity research has traditionally assumed that people attend to matching and mismatching features selectively (Sloutsky and Fisher, 2004; Tversky, 1977). Although researchers have demonstrated this idea using semantic concepts and their verbal attributes (Lee & Zeinfuse, 2008), few studies have investigated whether people selectively use a small number of features to perceive similarity of complex visual stimuli. In this article, we collected behavioral data from human participants and applied image-processing and machine-learning techniques to identify the visual features that were consistent with similarity judgments from the behavioral data In brief, we present a computational method to find possible weight distributions for candidate features in visual similarity judgment of animal faces. In the behavioral experiment, human participants judged similarity of original and morphed animal pictures. We collected ten original animal face photographs and created five animal pairs (i.e., bear-fox, cow-pig, hipposheep, koala-rat, and lion-horse pair). For each pair, one original picture (i.e., source) was merged with the other original picture (i.e., target) in Morphman 4.0 (2003). Eighteen morphed pictures were generated for each animal pair. Altogether, 90 morphed pictures (5 animal pairs X 18 morphed pictures) and 10 original pictures were used in the experiment (Figure 1). Participants viewed two original pictures of each pair and one morphed picture of the pair and judged which original picture (left or right) was more similar to their morphed composite (for a similar task, see Sloutsky and Fisher, 2004). The proportion of participants selecting the source picture (e.g., the original hippo face for the hippo-sheep pair) was recorded. To obtain a computational analogue of the behavioral data, we first identified 37 potential visual features that our participants might have used for similarity judgment of animal faces. These candidate features included texture difference, relative brightness and size, and contour of the faces. To obtain textural information, we computed Gabor-based textures Figure 1: Sample stimuli

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online multiple people tracking-by-detection in crowded scenes

Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...

متن کامل

An Effective Approach for Robust Metric Learning in the Presence of Label Noise

Many algorithms in machine learning, pattern recognition, and data mining are based on a similarity/distance measure. For example, the kNN classifier and clustering algorithms such as k-means require a similarity/distance function. Also, in Content-Based Information Retrieval (CBIR) systems, we need to rank the retrieved objects based on the similarity to the query. As generic measures such as ...

متن کامل

A Novel Method for Tracking Moving Objects using Block-Based Similarity

Extracting and tracking active objects are two major issues in surveillance and monitoring applications such as nuclear reactors, mine security, and traffic controllers. In this paper, a block-based similarity algorithm is proposed in order to detect and track objects in the successive frames. We define similarity and cost functions based on the features of the blocks, leading to less computati...

متن کامل

Visual-Interactive Similarity Search for Complex Objects by Example of Soccer Player Analysis

The definition of similarity is a key prerequisite when analyzing complex data types in data mining, information retrieval, or machine learning. However, the meaningful definition is often hampered by the complexity of data objects and particularly by different notions of subjective similarity latent in targeted user groups. Taking the example of soccer players, we present a visual-interactive ...

متن کامل

Effectiveness of Cognitive Captain's Log Software on Visual-Spatial Perception of Student with Learning Disabilities

Purpose: The purpose of this study was the Effectiveness cognitive Captain's Log software on visual-spatial perception for student with learning disability. Method: This research was a  pretest-posttest design with control group. The statistical population consisted of all students with learning disabilities who were referred to educational and rehabilitation centers of students with specific l...

متن کامل

Machine learning based Visual Evoked Potential (VEP) Signals Recognition

Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009